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THE CLAIMS DEFINING THE INVENTION ARE AS FOLLOWS: 

A method for data tabulation processing of a data file having a plurality of records in 
a {Plurality of data fields, comprising: 
5 \ i) a pre-processing stage in which, for each individual data field, each distinct 
data value is identified and allocated a numerical identifier unique for that field; and 

iri a tabulation stage in which, for each data record, a cell of a result array is 
determine?* based on the numerical id entifiers, for that record, and the result array cell 
incrementec 

10 

2. A method as claimed in claim L wherein the pre-processing stage includes generating 
from said data f life an encoded data file containing the numerical identifiers for the data values 
in each field, and V mapping file ^which stores a correspondence between each of the distinct 
data values in the fifelds and the corresponding numerical identifiers. 

15 

^vf" \ A method as claimed in cl *im 2, wherein a plurality of encoded data files are 
^ g^ner^ed in the pre-processing stage, one for each of the data fields. 

4. AVnethod as claimed in claim 2, including generating a tabulation result from said 
20 result arraAand said mapping file. 

5. A method as claimed in claim 1, wherein said tabulation stage includes initialising the 
result array having number of cells determined by the product of the number of numerical 
identifiers in the date fields. 

25 

6. A method as clain^ed in claim L wherein the tabulation stage includes selecting at least 
two data fields from the plurality of data fields for tabulation, and generating the result array 
utilising the numerical identifiers for the selected data fields. 

30 7. A method as claimed in clartn 6. wherein for N selected daia fields, a cell of the result 
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array is identified for each data record according to: 

I = K, + D X K 2 + D^ 2 K 3 +....+ D^.D^K* 
where I is the cell identity, 

K M K 2 , K N are the Numerical identifiers for the record in the selected fields, 
5 and D lf D 2 , D N . t are numbers of distinct values in the selected fields. 

8. A system for data tabulation processing of a data file having a plurality of records in 
a plurality of data fields, comprising: 

i) a coding processor in which, for each individual data field, each distinct data 
10 value is identified and allocated a numerical identifier unique for that field; and 

ii) a tabulation processor in which, for each data record, a cell of a result array 
is determined based on the numerical identifiers for that record, and the result array cell 
incremented. 

15 9. A system as claimed in claim 8, wherein said coding processor generates from said 
data file an encoded data file containing the numerical identifiers for the data values in each 
field, and a mapping file which stores a correspondence between each of the distinct data 
values in the fields and the corresponding numerical identifiers. 

20 10. A system as claimed in claim 8, Wrein a plurality of encoded data files are generated 
by the coding processor, one for each or the data fields. 

11. A system as claimed in claim 8, ^herein a selection of at least two data fields from 
the plurality of data fields is made in the tabulation processor, which generates the result array 

25 utilising the numerical identifiers for the selected data fields. 

12. A system as claimed in claim 11, wherein for N selected data fields, the tabulation 
processor identifies a cell of the result array Vor each data record according to: 

I = K, + D,K 2 + D X D 2 K 3 +....+ ^...Dk.Kn 
30 where I is the cell identity, 
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K, , K 2 K N are thfe numerical identifiers for the record in the selected fields, 

and D M D 2 , D N _, are numbers of distinct values in the selected fields. 
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